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ABSTRACT 


In this paper a discussion and re-analysis of experimental 
data is carried out. The data consist of measurements of the 
performance on two tasks (verbal and svatial) that are performed 
by the two different brain halves. The original experiments 
and analysis had been conducted by R. Klein and R. Armitage 
in the investigation of ultradian rhythms in humans. The 
period of the ultradian cycle had been hypothetically equated 
to the period of REM-sleep occurrence. In the re-analysis, 
previously unmentioned inhomogeneity was discovered and the 
findings by the original experimenter-analysts could not be 
confirmed at the stated significance levels. A discussion of 
and recommendations for the experimental set-up are included. 
The re-analysis was conducted with non-parametric methods. 
Small ultradian rhythmicity was concluded to have been induced 
by influence of circadian rhythm, experimental set-up and 


inhomogeneity among subjects. 





as. 


TABLE OF CONTENTS 


DISCUSSION ------------------------------------------ 9 
A. INTRODUCTION ------------------------------------ 9 
B. DEFINITION AND EXPLANATION OF CYCLES ------------ 10 
C. CIRCADIAN AND ULTRADIAN RHYTHM RESEARCH 

DURING SLEEP ------------------------------------ 12 
D. ULTRADIAN RHYTHM RESEARCH DURING WAFEFULNESS ---- 13 
E. ASSOCIATED RESEARCH PHENOMENA ------------------- 14 
F. IMPACT AND IMPORTANCE OF RESEARCH --------------- 15 
G. KLEIN'S AND ARMITAGE'S [1976] EXPERIMENTAL 

SET-UP ------------------------------------------ i 
H. CONSIDERATIONS ABOUT AN ADVISED EXPERIMENTAL 

SET-UP ------------------------------------------ 19 
I. DISCUSSION OF EXPERIMENTAL SET-UP --------------- 19 
STATISTICAL ANALYSIS -------------------------------- 29 
A. INTRODUCTION ------------------------------------ 29 
B. EXPERIMENTER'S ANALYSIS ------------------------- 30 
C. CRITIQUE OF EXPERIMENTER'S ANALYSIS ------------- 31 
D. PLOT OF RAW DATA (PERFORMANCE SCORES) ----------- 32 
E. FRIEDMAN'S TWO-WAY ANALYSIS OF VARIANCE --------- 34 
F. KENDALL'S COEFFICIENT OF CONCORDANCE ------------ 38 
G. FOURIER ANALYSIS OF RAW DATA (SASE VI) ---------- 40 
H. TEST FOR FLAT SPECTRUM -------------------------- 47 
I. KENDALL'S TAU ----------------------------------- 49 
J. SPEARMAN'S COEFFICIENT R OF RANK CORRELATIONS --- 51 
K. RANK SUM MINUS EXPECTED RANK SUM UNDER H.-------- 55 





L. LAG SHIFTING OF RANKS --99ee%%2%3-2 220-00 ---------- 
III. CONCLUSION --------- 2-9-9299 9 = 
LIST OF REFERENCES ----9- 299999 nr rrr re 


Pieri AG DISTRIBUTION List 





Piomeeor FIGURES 


READING ERRORS OF GAS METER INSPECTORS, COMPILED 
OVER 19 YEARS -------<-2-<<<----------- === == -- - - + - oS 


EFFECTS OF TWO MEALS (ABOVE) AND OF FIVE MEALS 
(BELOW) --------------------------------------------- 26 


WORKING TIME, FOOD INTAKE AND READINESS TO 
PERFORM << -<— —--— 2-2 2 oo oo a wn a wn + a 


PLOT OF VERBAL SCORES VS. SPATIAL SCORES: 
Peon ea eos nha oUbanCl NO, > =————--------— Si 


GRAPH OF SMOOTHED PERIODOGRAMS FOR SPATIAL SCORES 
OF THE TWO SUBGROUPS (A&B) AND FOR ALL SUBJECTS(C) -- 44 


GRAPH OF SMOOTHED PERIODOGRAMS FOR VERBAL SCORES 
OF THE TWO SUBGROUPS (A&B) AND FOR ALL SUBJECTS ----- 46 


PLOT OF THE RANK SUMS OF BOTH VERBAL AND SPATIAL 
RANKS AT EACH INTERVAL MINUS THE EXPECTED RANK 
SUM UNDER KLEIN'S HYPOTHESIS OF OUT-OF-PHASENESS ---- 56 


PLOT OF THE PEARSON PRODUCT-MOMENT CORRELATION 
FOR UP TO 15 SHIFTS IN POSITIVE AND NEGATIVE 
DIRECTIONS, RESPECTIVELY ---------------------------- 58 





af 


ei] 


ey 


VI 


ET 


Feed. T 


HEX 


x 


LIST .OF Tab Les 


CORRELATION COEFFICIENT OF VERBAL/SPATIAL 
PERFORMANCE SCORES -------------------------------- 


TWO WAY LAYOUT OF SUBJECTS AND THE RANKS OF 
SPATIAL TEST SCORES WITHIN EACH SUBJECT -----2------ 


TWO WAY LAYOUT OF SUBJECTS AND THE RANKS 
Oe iS eee Se So CORo NEG EY EACH SUBJECT --------- 


Pie ieen THE RIEDMAN STATISTIC FOR THE 
Poston tie POTHE StS OF NO ODEFFERENCE 
BETWEEN TREATMENTS (INDEPENDENCE OF SUBJECTS) ----- 


KENDALL'S COEFFICIENT OF CONCORDANCE AND 
teva ho lie Y FOR SEGNETFICANCE TESTS ------- 


THE PERIODOGRAM VALUES FOR EACH SUBJECT 
AT EACH FREQUENCY f£.=3/n; j=1,2,...,(n/2)-1 
FOR THE SPATIAL SCORES ---------------------------- 


THE PERIODOGRAM VALUES FOR EACH SUBJECT 
AT EACH FREQUENCY ee elie. ep m7 2) 1 


FOR THE VERBAL SCORES ------9-9---9-0--9--2--------- 


KOLMOGOROV-SMIRNOV TEST RESULTS FOR DIFFERENT 
P-LEVELS ------------------------------------------ 


RANKS OF VERBAL SCORES AFTER THE RANKS OF 
SPATIAL SCORES HAVE BEEN ARRANGED IN ASCENDING 
ORDER FROM 1 TO 32 (LEFT HAND COLUMN) ------------- 


LISTING OF P IN KENDALL'S TAU COMPUTATION AND 
STATISTIC L=K/SIGMA(K) FOR SIGNIFICANCE TEST ------ 


SPEARMAN'S COEFFICIENT OF RANK CORRELATION 
AND CONVERSION TO STANDARD NORMAL RANDOM 
VARIABLE ------------------------------------------ 





irae roc esco LON 


eee LNT RODUCTION 

An awareness and interest in various cyclic phenomena is 
not a recent development. Man, since his earliest existence, 
has been aware of cyclical influence in his environment: day 
changed to night, only to repeat itself about 12 hours later; 
Spring, summer, fall and winter followed each other continuously; 
the recurrent changing of the moon, ebb and flow of tides and 
the female menstrual cycle were obvious. The Greeks, however, 
were perhaps the first to actually apply a knowledge of cycles 
in an attempt to better understand their environment. Luce 
[1970] suggests that the Greeks applied their knowledge of 
cycles in the treatment of certain illnesses over 2000 years 
ago. 

This knowledge and interest was obscured during the period 
known as the Dark Ages. At the end of the 19th century a 
theory of biological cycles known as biorhythm rekindled the 
interest and stimulated research in the area. During the 20th 
century the interest has intensified with more cycles being 
identified and serious research activity devoted in an attempt 


to understand the nature of the various biological cycles. 





pee Lee ENETION AND EXPLANATION OF CYCLES 

A cycle is a course or series of events that recur regularly 
and usually lead back to the starting point. [Webster, 1976] 
Meemematicaily, a function 1s cyclic with period T if 
X(t+kT)=x(t) for k=0,-1,72,... 

A rhythm is a regularly recurrant quantitative change in a 
biological process. [Webster, 1976] 

A rhythm - as defined above - is produced by oscillators; 
these can be either linear or non-linear oscillators, thus 
causing linear (e.g. Sinusoidal) or non-linear (e.g. relaxation, 
"saw tooth") oscillations. 

Just as the cycles observed in man's environment since 
ancient times occur in different frequencies, the periods 
Meeover frequency) of rhythmic activities in biological 
systems vary from seconds or minutes (e.g. in cell growth 
and division) to hours (e.g. the 24-hour cycle of the circadian 
rhythm) to days or weeks (e.g. the 28-day clcle of female 
menstruation). 

The terminology of cycle research is grouped around the 
most odvious one, the cycle with a period of "about a day" 
(about 24 hours). Dr Franz Halberg coined the term "circadian 
Beem for this from Latin “circa - about" and "dies - a day.” 
Cycles with periods shorter than a day are called ultradian, 


those with periods longer than a day are refereed to as 


iticadian cycles. 
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There are essentially two different possible explanations 
of rhythmic activities in biological systems. The first is 
that it is an essential dynamical feature of the process 
observed, 1.e. it is part of the process and occurs within 
it at a certain place regardless of when and where the whole 
process takes place. The second is that such rhythms represent 
adaptive responses of the organism to a periodic environment; 
1.e. if the organism were not or had not been exposed to the 
periodicity (1.e. entrained by zeitgeber, a periodic clue- 
giving environmental feature), it would not show the rhythm. 
[Oatley and Goodwin, 1971] 

An example for the first possible explanation - a dynamical 
feature regardless of the environment - is found in cell growth 
and division. If the cell division occured when triggered by 
the environment, it could easily happen that the resulting 
daughter cells would not receive a full set of the hereditary 
material (DNA), would then be deficient in genes and would not 
be able to survive. Considering the enormous number of cell 
divisiors it takes to produce a new life, the probability of 
a healthy being to result from a cell division process that 
was environmentally induced is very small. 

The second possible explanation - entrainment by zeitgeber - 
appears to account for all those cases that seem to have 
Originated as a result of adaptation to a periodic environment. 


The most often discussed and rather obvious zeitgeber is the 
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22-hour day, held responsible for entrainment of the circa- 
dian rhythm; another is the 12-hour tidal cycle. [Its 
Sntrainment has been studied in marine (littoral) organisms. 

A most difficult to explain phenomenon occurs when the 
Frequency of one internally caused cycle falls into the range 
of an environmentally induced cycle. The resulting complicated 
periodic organization then has to be taken as partly adaptive 
and partly of internal orgin. [Oatley and Goodwin, 1971] Bruce 
[1965] demonstrated that this is often the case for cell 
division of multicellular organisms. Generation times were 
found to converge in the neighborhood of 24 hours or a 
Multiple thereof. Interaction of this kind may also occur 
when there are simple multiple relations between frequencies 


like the 24-hour cycle and the 28-day cycle. 


C. CIRCADIAN AND ULTRADIAN RHYTHM RESEARCH DURING SLEEP 
While psychological and physiological phenomena of the 
circadian as well as of the infradian rhythm have been 
researched extensively [Kleitmann, 1949; Harker, 1958; 
meevom:, 1963; Lobban, 1965; Colquhoun, 1971], ultradian 
rhythms have only been studied during sleep-phases or in 
relation to sleep. Ultradian rhythms are those with a period 
shorter than 24 hours, a frequency thereby of more than one 
per day. With the use of electroencephalograph (EEG) that 
records voltages of the biological system through sensors 


connected to various areas of the skull, the depth of sleep - 
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among other things - can be measured. Not only have different 
levels of sleep been identified but it has also been found 

that one of the levels recurs in a cycle of about 90 minutes. 
This level or stage is the rapid eye movement stage (REM-stage), 
named after its physiological phenomenon. Another name for it 
is dream-sleep-stage, since it is during this stage that a 
person dreams. The REM cycle is the best researched of all 
ultradian cycles in animal and human observation. Hartmann 
[1967] quotes the average lengths in minutes as follows: 

mouse (3-4), rat (7-13), rabbit (24), opossum (17), cat (20-40), 
monkey (40-60), man (80-90), elephant(120). Kleitmann [1967] 
has attributed it to the change in metabolic rate. Be this 

as it may, the REM cycle during sleep is an excellent example 


of an ultradian rhythm. 


D. ULTRADIAN RHYTHM RESEARCH DURING WAKEFULNESS 

These findings accepted, the logical question to ask was 
whether a continuation of the REM cycle during the wakeful 
state could be hypothesized and observed. Kleitmann [1963] 
Suggested that the basic rest-activity cycle of 80-90 minutes, 
as seen in the recurrant stages associated with dreaming in 
actual sleep, may persist during the waking period also, 
manifesting itself in recurrant fluctuations of alertness. 
ferqunoun, 1971) 

From there on it was only a question of time until 


scientists would look for parallels between the - by now well 
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researched - field of sleep and that of wakefulness. [In 1922 
a group of subjects had been kept in bed all day and a roughly 
90-minute rhythm of body movement and stomach contraction had 
been observed. [Luce, 1970] Oral activity (eating, drinking, 
smoking) was found to increase and decrease with a range of 


85-110 minutes with a mean of 96 minutes. [Friedman, 1965] 


E. ASSOCIATED RESEARCH PHENOMENA 

Scientific biological experimentation has always focused 
on animals. Hundreds of studies have been conducted observing 
anything from fruit flies to mice to monkeys. While the 
relative availability, short life cycles and possibility of 
complete destruction made observation and evaluation in 
animals easier than in human experiments, it also seems to 
have led astray many scientists and their audiences into the 
belief that primitive creatures certainly followed certain 
instincts and reflexes in leading their somewhat artificial 
lives. Many humans, however, would not readily accept that 
man himself, considered the crown of creation, still followed 
old innate or environmentally entrained oscillators that 
contribute to many of his emotions, actions, reactions and 
physiological phenomena. 

It is also true that biological research in the field of 
cycles and rhythms is most difficult to perform on humans. As 
man subjects himself to externally imposed schedules of work, 
Sleep, entertainment and food intake, the biologically "normal" 
cycle is often disturbed or willfully counteracted (e.g. by 


mrgitt work) . Les 





F. IMPACT AND IMPORTANCE OF RESEARCH 

the amportance of research in the field of rhythmic changes 
in human performance becomes obvious once one realizes how 
many actions, reactions and their effects or evaluations depend 
on human reaction time, accuracy, acuity, steadiness, etc., 
all potentially subject to variation via internal "biological 
clocks." [{Kleitmann, 1949; Harker, 1958; Aschoff, 1963] While 
the circadian rhythm is generally accepted and - sometimes - 
considered in the scheduling of human activities (e.g. night 
shifts, watch bills), an ultradian rhythm has so far not been 
Shown to occur during man's wakefulness and has therefore not 
been considered. 

If it exists and its existence makes a significant difference 
in many or all of the above mentioned human activities and if 
mms Dredictable, the impact of its discovery could be 
extensive. Long term employment and work output may not be 
affected since in the long run (e.g. a day) the ups and downs 
would smoothe out to an average performance level that would 
follow the circadian rhythm. Any short term employment and 
output, however, could be affected significantly. School exams 
of one hour duration could fall either in the improved perform- 
ance period or the degraded performance period of such a cycle; 
reaction to warning signs or signals could be impaired if the 
Signal presentation occurred during the "down period." 

The most significant immediate impact would probably be 


felt in the scientific community where tests of short duration 
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are very common. Medicine and medical research might have to 
revise some of their findings. For example, human reaction 

to drugs might be time-dependent or the effect of drugs could 
be over-shadowed by the effect of the ultradian rhythm. A 
drug that had been thought of as improving reaction time might 
be ineffective and any faster reaction time might have been 
due to an upswing in the ultradian rhythm. The effect of a 
drug that had been found ineffective might not have demonstrated 
desired results because a subject was on a downswing in the 
ultradian rhythm. As for the above mentioned drug experiments 
many subjects are used, it is not very likely that the 
discovery or acceptance of an ultradian rhythm would overthrow 
all findings, but it should be considered in the planning of 
future experiments, especially those that involve only few 
subjects and are of short duration. 

An important question to ask before the above contemplations 
are considered, is whether performance is related to the ultra- 
dian rhythm - should it exist. Hunger (stomach contractions, 
sucking movements of babies), sex (penile erection during sleep 
or very tired wakefulness) and survival (REM sleep "“alertness") 
are all neurologically based in the oldest part of the brain. 
Does performance ever draw from those parts or should it be 


influenced by them? 
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G. KLEIN'S AND ARMITAGE'S [1976] EXPERIMENTAL SET-UP 

Based on the previously mentioned suggestions that a 90- 
minute cycle observed during sleep might continue in some form 
during the wake state, Klein and Armitage [1976] of the 
Department of Psychology at the Dalhousie University, Halifax, 
Canada, designed an experiment to investigate the hypothesis 
that the above suggestion was true. In addition to this 
hypothesis they also investigated previous findings by other 
scientists [Cohen, Gross; Geffen, et al. 1972] which indicated 
that the two brain halves - the right and left hemisphere - 
process different kinds of information. According to those 
findings the right hemisphere specialized in processing visual- 
Spatial, wholistic, non-logical information, while the left 
half specialized in verbal-linguistic, analytic and logical 
processes. Combining these two areas of study the hypothesis 
under test was that performance measurements on each of two 
tasks - to be described below - should reveal rhythmic 
oscillations with a period of approximately 80-120 minutes and 
that equivalent phases of the two rhythms should be separated 
by 180° (one half cycle). 

The experimenters chose two simultaneous matching tasks. 
For the spatial task subjects were required to match dot 
patterns of seven dots each, the verbal task required them 
to match letters. Two dot patterns had to be judged same or 
different and a capital letter was compared to a lower case 


letter and judged same or different. The actual recording of 
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the decision was made by the subject by crossing out S's or 
D's for same or different. The subjects were eight young 
adult volunteers in the age range of 18 to 24; there were 
five females and three males who were right-handed and had 
no left-handed siblings or parents. 

All eight subjects were tested in a group testing room, 
all on the same day. They had two booklets in front of them 
on a desk, one for each task. The booklets contained 48 pages 
with 96 problems to a page. (The booklets were to be used over 
again after completion. An influence of the outcome by using 
the same problems over again was not to be expected since over 
4000 problems would have to be solved before they repeated 
themselves). 

The subjects were told to record their same/different choices 
as quickly as possible without making errors. They were given 
practice on each task to account for a possible learning 
curve (LC). 

Performance was to be measured on each task every 15 minutes 
for three minutes. The test started at 0900 and ran for 8 hours, 
yielding 32 data points (observations) per task per subject. 
Lunch was from 1230 to 1300. The testing was not interrupted, 
but subjects ate in the free minutes during this period. 
Subjects were not allowed to talk to one another but could walk 
around, draw or read. They also reported in writing their 


thoughts at the end of every 15-minute interval. Dependent 
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Variable for the test was number correctly matched pairs in 
each 3-minute test period. This was accepted to reflect the 
subjects’ rate of accurate performance on the matching tasks, 


Since the error rate was low (1.63). 


fee CONSIDERATIONS ABOUT AN ADVISED EXPERIMENTAL SET-UP 

Before listing some general thoughts on experimental set-up 
for the kind of experiment as the one described in this thesis, 
it should be emphasized that by mentioning a consideration or 
recommending a procedure it is in no way implied that Klein 
and Armitage did not consider such question in the conduct of 
maelr effort. A few of the facts from their description of 
the experiment are contrary to recommendations that seem to 
be justified from the study of related research phenomena. 
Since it is impossible to describe an experimental set-up 
within a journal article of reasonable length and mention all 
reasoning behind each element of the set-up, any of the 
following recommendations about the conduct of such an experi- 
ment that are not in direct contrast to the way it actually 
was done should be understood more as an enlargement of the 
background knowledge than as an insinuation that Klein and 


Armitage might have neglected to consider them. 


fil loCUSSION OF EXPERIMENTAL SET-UP 
To set up an experiment with expected outcomes of any 


Significance several principles need to be observed: 
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The number of test subjects needs to be large enough 
to rule out findings by chance or idiosyncracies of 
the subjects chosen. For tests involving humans as 
subjects numbers have been as small as two in many 
experiments. This is very close to the borderline of 
unacceptability. For an experiment for which it is 
anticipated that the findings might be generalized to 
all humans, a representative mix of subjects has to 
be selected. Men, women, young, old,well educated 
and undereducated are some immediate distinctions. 

The last two categorizations might be especially 
Critical when the experiment is a test and the 
variable tested is performance. 

The test subjects are not to be informed of expected 
outcomes, but need to know in general how the test/ 
experiment is to be conducted. Without knowledge of 
procedures and measurements, human curiosity, anxiety 
or even fear could seriously contaminate test results. 
The interaction between personnel administering the 
test elements and the Ss needs to be formalized and 
strictly standardized without the least chance for the 
Ss to "read between the lines" of any comments. 
Otherwise, the Ss will tend to "deliver" the desired 
result to the experimenter. [Rosenthal-effect; Russell, 


377) 
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Humans tend to act differently when they feel they 

are being ovserved than when they think they are 
unobserved. The increase in activity or productivity 
of working people due only to the fact that they are 
being "paid attention to" is known as the Hawthorne- 
effect. In performance-observation experiments this 
effect must not be neglected. 

Constant presence and neutral behavior of the administer- 
ing personnel are of paramount importance, if the 
experiment measures possible performance variation. 

If interspersed absence of the administering personnel 
is necessary, the times and lengths of absence need 

to be randomized and noted in relation to experimental 
outcomes. 

Any regularity during an experiment directed to 
investigate rhythmic phenomena has to be avoided in 
Sec Net oe Imcuce an artifical rhythm or cycle or 
pelanmonre Of  1t : 

Physical separation of subjects would be helpful to 
ensure absolute non-interference between them. In 
tests like the one studied, where booklets have to 

be moved and pages are turned or moved, the simple 
sould of one S$ turning a page may be enough to evoke 


a “competitive spirit" in another. [Blake, 1970] 
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HO 


Immediately connected to avoidance of regularity and 

to separation of Ss, the next aspect of good experimental 
set-up would be temporal separation of the subjects. Not 
only would possible interaction between them be eliminated, 
but also the circadian-rhythm-and external everyday- 
life-cycle-induced regularities would be kept from 
contaminating the test results. 

A different approach to the randomization of the times 
of test would be to use a smaller number of Ss - 

thereby increasing the homogeneity of the Ss - and 

have them perform on very similar tasks elements over 
several 8-hour-time blocks ~- beginning at different 
times - for several days to obtain at least the same 
amount of data points. 

The biggest interference with any test of a length of 
time greater than time between meals, of course 1s a 
meal. Among the meals again it is lunch that seems to 
have the greatest effect on the human daily routine. 
While lower animals can be turned around completely 

and their biological clocks readjusted [Lobban, 1965], 
this is not easily done in humans. A contributing 
factor here are the social constraints that mankind 

has subjected itself to and that are difficult to 

alter. Lunch has become a social function during 

which ideas are discussed, business deals are talked 


over, friends are met, the workplace is left and in 
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many cases the family or home is visited. Especially 
in the last two aspects the lunch break or lunch time 
has become something to look forward to, to work up 
Pe EOmcoOnaleLloneone's mind towards. It cuts the 
working day in half, is something desirable and a 
great motivator. The administrator of an experiment 
that extends through the normal lunch period has to 
be aware of this conditon of the human. 

Besides the psychological phenomenon of looking forward to 
that time of day, a physiological phenomenon exists. By about 
lunch time, breakfast, the earlier meal of the daily eating 
routine, has been digested and the body is running out of 
nutrients to keep working at the same pace as it has been 
during morning; the blood sugar level, for example, reaches a 
low level. [Haggard and Greenberg, 1935] The body is ready for 
a refill. It reminds the person of this fact by a feeling of 
hunger. Following this feeling, the person eats and experiences 
that the blood -eager to assimilate the nutrients from the 
food ~- fills the capillary system of the intestinal digestive 
tract rather than that of the brain. A decrease in the amount 
of blood in the brain area sets in and results in tiredness. 
Instead of following this feeling, however, and taking a siesta 
or a long noon break as is done in southern countries - the 
average person in industrialized countries continues to work 
following lunch. With reduced efficiency and reduced presence 


of mind and responses, the probability of errors, misjudgements, 
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erratic and useless movements increases, often resulting in 
accidents, injury or death; always resulting in reduced 
performance. As for the experimental set-up, the evaluation 
of various studies concerning effects of food intake on 
physical efficiency and mental acuity needs to be taken into 
acocunt. [Bjerner, Holm&Swensson, 1955; Haggard & Greenberg, 
fee) in a long-term study (19 years), reading errors of 
meter readers (in Swedish gas works) were recorded (Figure 1); 
"reading errors were most frequent shortly after the midday 
break, during meal digestion." [Grandjean, 1970] Measurements 
of blood sugar level and respiratory quotient (Figure 2) during 
experimentally varied timings of meals indicated that low 
levels of these two indirect measures of efficiency can be 
avoided by increasing the number of meals to five and spacing 
them less far apart. [Haggard & Greenberg, 1935] [Figure 3] 
These findings have been confirmed by other experimetns. 

Not for performance reasons initially, but for health 
reasons, dieticians have recommended the same timing of meals 
to relieve the body from the extremes of fullness and emptiness 
and have its metabolism instead perform at a rate that is more 
moderate and closer to constant. 

The administering of in-between-meals during an experiment 
researching cyclical phenomena is therefore advisable since it 
will smoothe out physiologically induced variations of 


performance. 
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Any degradation of performance around the lunch time of 


the subjects - as observed in the data under discussion - 


emaould therefore not be attributed to an ultradian rhythm 


without keeping in mind that it is also the expected occurence 


following the circadian rhythm of the average subject. 


FIGURE 1l. 
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READING ERRORS OF GAS METER INSPECTORS, COMPILED 
OVER 19 YEARS. READING ERRORS REACH A MAXIMUM IN 
THE EARLY AFTERNOON (DURING THE DIGESTIVE PERIOD) 
AND DURING THE NIGHT WORK IN THE EARLY HOURS OF 
THE MORNING. [Bjerner, Holm, Swensson] 


ll. A further point of consideration in the design of an 


experiment during which measurements are taken 


repeatedly is the possibility of the subject's 


improvement due to a learning curve (LC). In the 


experiment under discussion the test problems were 


arranged in booklets of 48 pages each with 96 problems 


to a page. A subject, therefore, had to complete well 
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PeeGURE 2. 
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EFFECTS OF TWO MEALS (ABOVE) AND OF FIVE MEALS 
(BELOW). THE RESPIRATORY QUOTIENT CHANGES WITH 
THE BLOOD SUGAR AND CAN BE REGARDED AS AN INDIRECT 
MEASURE OF EFFICIENCY. WITH FIVE MEALS THE VALUES 
DO NOT DECREASE AS MUCH AS WITH TWO MEALS, SO THAT 
EFFICIENCY REMAINS HIGHER THROUGHOUT THE WORKING 
DAY. [Haggard & Greenberg] 
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Readiness to perform 


PeEGURE 3. 
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WORKING TIME, FOOD INTAKE AND READINESS TO PERFORM. 
DURENG WORKING HOURS THE FOOD INTAKE SHOULD CONSIST 
OF TWO SNACKS AND A SMALL MID-DAY MEAL FOR REASONS 

OF HEALTH AND WORKING EFFICIENCY. [Grandjean] 


over 4000 problems before she or he would return to 
the same problem. A LC due to same problems can 
consequently be ruled out. The existence of a LC 

in as far as finding a method to mark the answers in 
the best way was intended to also be ruled out - 
"Subjects were given practice" [Klein] - the data, 
however, indicate a general performance increase 
(increase in number of problems solved) for the 


first few test intervals. 
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Last not least it must be understood that subjects will 
introduce their own bias into an experiment. Any effort which 
requires subjects to react in two ways that determine one 
outcome which is then measured: In Klein's experiment the Ss 
were asked "to make their same/different choices as quickly 

as possible without making errors." [Klein, 1979] This clearly 
burdens them with the responsibility to decide for themselves 
what is meant by "as quickly as possible" when weighed against 
Mveenout making errors." The fact that they did make some 
errors indicates that they really made their same/different 
choices more quickly than possible without making errors. It 
needs to be examined which factor caused them to "accept" an 
error rate of 1.6. Had more emohasis been put on accuracy a 
zero error rate could have resulted, more stress on speed 
could have caused a higher error rate. This difference in 
emphasis can be looked for in the experimental instructions, 
in the way the instructions were given and possibly explained, 
but especially in the Ss' own understanding of the instructions, 
anticipation of a desired result and a personal weighting of 
the respective importance. This individually different trade- 
off of speed versus accuracy influences the outcome and needs 


to be considered. 
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ct oe eo PL eRe ANALYSIS 


A. INTRODUCTION 
The results of the analysis performed by the experimenters 
[Klein and Armitage] are as follows: 

1. There is an ultradian rhythm of approximately 90 
Minutes in the performance data. 

2. Verbal and spatial scores tend to have rhythms at 
this frequency which are 180° out of phase, i.e., 
verbal performance peaks when spatial performance is 
at its lowest level. 

The statistical analysis given in this thesis and which is 
presented in the next sections comes to some distinctly 
different conclusions. 

1. Most importantly the behaviour of the subjects is 
definitely inhomogeneous over the set of subjects. 
It will be shown that there are two sets of subjects 
which, within groups, show homogeneous behaviour in 
Space (ensemble) and time, but between groups show 
disparate behaviour in frequency and time. 

2. The only evidence for non-stationary behaviour in 
time in the two subgroups appears to be a four hour 
cycle, believed to be caused by a lunch-time effect. 

3. There is very little evidence for a strong out-of- 


phase link between spatial and verbal behaviour. 
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The analysis is complicated by the fact that the usual 
assumptions of homogeneity in space and time are not valid, 
and that we are looking at eight bivariate (verbal-spatial) 
processes. Clearly there will be interactions between the 
inhomogeneities. The inhomogeneity between Ss will be 
presented first since it dominates the inhomogeneity in time. 
It should be noted that the data set is not large enough 
to make very fine distinctions in the underlying behaviour 
mechanisms. Thus the conclusions beyond those of subject 


inhomogeneity are tentative. 


B. EXPERIMENTER'S ANALYSIS 

The experimenter converted the 32 scores of each S and 
each task to Z-scores through the process of normalization 
and performed a Fourier analysis. Power spectral peaks 
appeared at 4 hours, 96 minutes and at 37 minutes. Observed 
peaks were Significantly larger than one would expect by 
chance (P .01) when tested through the use of two methods: 
In a "rank test" method the power spectra of each subject 
were replaced by their ranks and then ordered increasing 
from 1 to 16. A mean rank across eight Ss for each period 
was computed. Mean rank above 12.3 were found to be larger 
than expected by chance at the .01 level in a one-tailed 
est . 

In a “randomization test" 500 random orderings of each 


S's scores for each task were subjected to Fourier analysis. 
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feang the mean root power of the random sequences for each 
subject, task and period the probability was determined of 
obtaining, by chance, a spectral value (root power) as large 
or larger than actually observed. Combining the probabilities 
across Ss it was determined for the group which periods had 
consistently larger peaks than would be expected by chance 


at the .0l1l level. 


C. CRITIQUE OF EXPERIMENTER'S ANALYSIS 

The scores are actually quite complex, since they are 
the number of correctly performed tasks in the three minute 
interval. No information 1s available on the actual number 
of tasks attempted. Thus the scores measure a mixture of 
the Ss's speed and error-proneness. 

Standardizing by converting to Z-scores is also open to 
question. This standardization assumes normality. In fact 
the scores are quaSi-count data, so that the mean and variance 
of the scores will not be independent. A Square-root trans- 
formation, both to standardize the variance and induce 
normality would have been preferable. An alternative 
preferred in most of the analysis used in this thesis is to 
replace the data by the ranks within Ss. As will be seen in 
the two-way layout given below, this immediately shows up 
an inhomogeneity in the Ss's behaviour over time. 

The spectral analysis is also subject to criticism. The 


technique is valid to test that the spectrum is not flat, 
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with the proviso that the maximum of the mean ranks must be 
used as the test statistic and the subjects must be 
homogeneous. Alternate tests for each spectrum are given 
below. However, rejection of flatness of the spectrum could 
indicate either that serial correlation is present in the 
Meta (very likely) or that there 1s a cycle, or both. The 
test results given below indicate very different results when 


the two subgroups of subjects are analysed separately. 


D. PLOT OF RAW DATA (PERFORMANCE SCORES) 

The analysis was begun by plotting the raw data. In this 
procedure, the spatial scores were marked along the horizontal 
axis, the verbal scores along the vertical axis. It was the 
goal of this plotting, which is shown in Figure 4, to display 
any correlation between the two elements of the paired test 
scores. 

The Pearson product-moment correlation coefficient con- 
firmed the visually apparent slightly negative correlation 
between the test scores of each of the eight subjects, as 
tabulated in Table I. 

The approximate standard deviation for the coefficient 


in normal samples is n77?=3277 


=0.177. Thus since we are 
looking at 8 of these values, the maximum value of -.48 is 
probably not significant. There is, however, a slight 


indication of negative correlation since all the correlations 


are negative. This question will be returned to later after 
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Mespectral analysis 1S done on the 16 series (8 for verbal, 


fefor spatial). 


verbal 
scores 


250 


150 


100 





Spatial scores 


Pours 4, PLOT OF VERBAL SCORES VS. SPATIAL SCORES; 
REG a Mose vi  PhOl, HERE: SUBJECT NO. 5. 
tieoeeorrUlLED CORRELATION COEFFICIENT IS 
=—Jo°  ROMemanir, . . 






Subject 


Correlation 


TABLE I. CORRELATION COEFFICIENT OF VERBAL/SPATIAL PERFORM- 
ANCE SCORES. 


33 





E. FRIEDMAN'S TWO-WAY ANALYSIS OF VARIANCE 

In the following section a nonparametric analog to the 
two-way analysis of variance procedure is performed. The 
data are presented in a kXn two-way table where k is the 
number of subjects and nis the number of trials (ranks). 
The subjects cannot - as the analysis will show - be 
considered a single, homogeneous random sample because of 
certain relationships between them. The rows in Tables II 
and III indicate subjects, the columns are ranks, i.e. the 
scores have been replaced by ranks within subjects. While 
the row totals are constant (sum of ranks) the column totals 
are only expected to be the same if there is no difference 
in the observations that lead to the ranks. Since the sum 
of deviations of observed column totals from expected column 
totals k(n+l)/2 is zero, the sum of squares of the deviations 
will reveal differences in observations, i.e. homogeneity or 
lack of homogeneity in the population of eight subjects. The 
first thing which is evident from this table is that the 
behaviour of the ranks classifies them into two groups. Thus 
note that at time period 7 subjects 1, 2, 3, 8 have low 
Spatial ranks, while subjects 4, 5, 6, 7 have high ranks. 
Similar opposite behaviour is seen at other time periods, 
e.g. 12, 16, 19, 23, 30. The same effect can be seen from 
the verbal ranks (Table III); in fact, it shows up from the 
verbal ranks in most cases in the same time period as it 


does from the spatial ranks. 
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This observation is crucial. We will now show that for 
overall eight subjects the Friedman statistic shows no common 
effect, but that this changes drastically if the statistic 
(or the equivalent Kendall's coefficient of concordance) is 
applied within subgroups. 

The random variable S, the test statistic for the Friedman 
two-way analysis of variance, under the hypothesis of no 
difference between observations (ranks) therefore has a 
sampling distribution as follows: 


n 2 
2 Peta) 
= 


meamoaoman's statistic F is a linear function of this random 


variable and is defined as: 


RS -~ 3k(n+1) 
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It can be treated as a chi-square variable with n-l degrees 
of freedom as long as n is greater than 7. The results are 
Given in Table IV. The rejection region for the test of the 
hypothesis of no difference in treatments (ranks) with n = 32 
consists of all values greater than 45 or a = .05 and of 

all values greater than 52.2 fora= .01. This means - since 
all F-values of Table IV definitely fall into this region - 


rejection of the hypothesis of no difference at the .05 level. 
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At the .01 level the overall F-statistic for the spatial 


task falls just below the rejection region. 
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Mepoe IV. LISTING OF THE FRIEDMAN STATISTIC FOR THE TEST OF 


THE HYPOTHESIS OF NO DIFFERENCE BETWEEN TREATMENTS 
(INDEPENDENCE OF SUBJECTS). 


F. KENDALL'S COEFFICIENT OF CONCORDANCE 

The above mentioned statistic F provides a possibilitiy 
to test the hypothesis of no difference between "treatments" 
- that results in ranks - at any level a. Actually of more 
importance 1s homogeneity or concordance between subjects. 

A method to describe the concordance between subjects 
and to find a measure of relationship between rankings is 
the computation of Kendall's coefficient of concordance, W. 
This coefficient measures agreement between subjects, 
concordance between rankings or dependence of sample. It is 
analogous to the two-sample measurement of concordance; 
however, it does not range from -1 to 1, but only from 0 tol. 


The reason for this is that among more than two samples 
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peeeeecuraduscordance (aS =—l andicates it in the two-sample 
case) cannot be defined. Zero means no agreement between 
subjects or independence of samples, 1 indicates perfect 
concordance. The statistic W is defined to be the ratio of 
S (as described in the previous section) to its maximum 


possible value Ke (Gaseous, 1971, pg. 251). 


c 


7 eine 
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As can be seen from Table V the complete group of eight 
subjects 1s rather inhomogeneous, while the two previously 
identified subgroups 1, 2, 3, 8 and 4, 5, 6, 7 are much more 
homogeneous within groups. Table V also contains the 
necessary conversion of the statistic W to a statistic Y, 


a test statistic that allows a significance test at level a. 


overall |group 1238 group 4567 





verbal 5 32026 
W ns) 3 
i 90.98 

Spatial | S So 1 32389 30968 
W .19426 27 4205 .70949 
i 48.18 $2701 87.98 


eon, Ve KENDALL'S COEFFICIENT OF CONCORDANCE AND THE TEST 
STATISTIC Y FOR SIGNIFICANCE TESTS. 
Y = k°(n-1) °-W Y = k°31-W 
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This statistic is distributed chi-square with n-l degrees 

of freedom. Ina test of the hypothesis of independence 
between subjects large values of Y should fall in the 
rejection region. For n = 32 and a equal .05, values greater 
than 45, for a = .01 values greater than 52.2 have to be 
considered large. The result shown in Table V is therefore 
the same as that of Friedman's two-way layout discussed 


above. 


G. FOURIER ANALYSIS OF RAW DATA (SASE VI) 

The raw data without having been normalized or smoothed 
previously were subjected to Fourier analysis using the 
available SASE VI - program [LEWIS,1976:. 

The intent of this step of the analysis was to see if 
the power spectra (or amplitude) for the verbal and spatial 
performance scores as shown by Klein could be duplicated. 
“Further it was desired to see if there were effectively any 
peaks in the spectra at any frequencies, and to compare the 
verbal and spatial spectra. The computation is as follows: 
let the set of data be indicated by X., em 2 ee. se ths 
The Finite Fourier transform of this data is 


aaa 7 l2i (2-1) -j/n 
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where i=V-1l . The periodogram, an estimated amplitude 


feectrum Of the data is defined to be 


ee [re 
P (3) = la (3) | 


This estimates the spectral density of the sequence Xo 
a 


given by 


fF+(w) = 7 1+2 Op. cos (jw) ' 


where the serial correlations 05 are defined to be 


(eee, 22, 


Ps = corr (XpXp 15 
We note that f+(w) = = if the data are independent. More 


‘important are the sampling properties of the periodogram 


meints P(4)° These are approximately independent and 
exponentially distributed with mean value f+(w). In 
particular 1f the spectrum is flat f+(w) = = and the 


TI 
periodogram points are approximately i.i.d. exponential. 


Before doing any formal tests for departure from a flat 
Spectrum, it is worth looking at the spectra graphically. 
The spectra are smoothed by averaging over the eight subjects 
at each frequency f (or J), as was done by Klein. The 
numbers are given in Tables VI and VII, and the graphs in 


Figures 5 and 6. Klein's spectra are essentially reproduced 
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and the dominant feature in both is the peak at a frequency 
corresponding to a period of four hours. If this peak were 
Significant, it would correspond to the lunch time effect 
meted in the introduction. 

However, in the previous section we have seen that there 
is an inhomogeneity in the subjects. Therefore smoothed 
periodograms for the subjects 1, 2, 3, 8 and 4, 5, 6, 7 are 
shown for both verbal and spatial tests (Figures 5 and 6). 
The most interesting aspect of this split 1s that the strong 
Similarity between the overall verbal and spatial periodograms 
disappears. In the spatial case (Figure 5) the dominant 4 
hour peak is still apparent. However, the verbal periodogram 
for subjects 4, 5, 6, 7 does not have this peak. In fact, 
the verbal periodograms for the two groups are dissimilar, 
reinforcing the conclusion of heterogeneity in the sample of 
“eight subjects. 

Before any other inferences can be drawn from the 
periodograms, it is necessary to check the significance of 
the departure from a flat spectrum, since the peaks in the 
spectra may be due simply to sampling fluctuations. This is 


done in the next section. 
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FIGURE 5. GRAPH OF SMOOTHED PERIODOGRAMS FOR SPATIAL SCORES 
OF THE TWO SUBGROUPS (a & b) AND FOR ALL SUBJECTS 
(c). IN ALL CASES THE 4 HOUR PERIOD COMPONENT 
(j=2) IS THE MAXIMUM PEAK, BUT IN THE CASE OF 
SUBJECTS 1238 THE 96-MINUTE PEAK IS NOT LARGE. 
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FIGURE 6. GRAPH OF SMOOTHED PERIODOGRAMS FOR VERBAL SCORES 
OF THE TWO SUBGROUPS (a & b) AND FOR ALL SUBJECTS 
(Cae lise DirP ERENCE EN THE SPECTRA OF THE TWO 
DUBGROUPS £S STREKENG. 
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nee TEST FOR FLAT SPECTRUM 

The pooled Fourier analysis performed by the experiment- 
ers [Klein, 1979] yielded a power spectrum in which dominant 
peaks occured at 4 hours, at 96 minutes and at 37 minutes. 
A Kolmogorov-Smirnov test for flat spectrum was applied to 
test the significance of all peaks together. This tests the 
independence of the 32 data points which go into making the 
Spectra. The idea behind this is that since the p(j)'s are 


approximately i.i.d. and exponential, then 


s 
- ep () 


i=l 
I(j) = 
n 
xl 
De 
i=l 
’ i Ly ee 


Maem—oraer Statistics from a uniform (0, 1) distribution 

under the null hypothesis of a flat spectrum. [Cox & Lewis, 
1966] But this is the canonical form of the usual tests of 
goodness-of-fit and thus the Kolmogorov-Smirnov and Anderson- 
Darling tests have been applied. [Cox & Lewis, 1966] The 
results are shown in Table VIII. For the verbal spectra it 
is striking that for subjects 1, 2, 3, 8 the K-S test is 
feomy Significant, while it is not significant for subjects 


4, 5, 6, 7 at any level. Therefore the conclusion might be 
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Ghat as far as verbal tests are concerned there is a strong 
lunch time effect in one case and not in the other. There 
is a much smaller chance that spatial tests were affected by 
the lunch time effect for the group 4, 5, 6, 7. This would 
confirm some opposition for verbal and spatial tests, with 
the qualification that 1t is a subject dependent result. 

It would be difficult to extract from any of these spectra, 
jointly or spearately, any indication of a 90-min cycle in 
the data. The 90-min peak is big when the 4 hour peak is 


big, indicating that one sees a harmonic of the long cycle. 


Subject 


Spatial KS Pe OUS | sO. so0 0.741) 1.376) 1.237 | 0.908 | 1.303 | ----- 


two-sided test 
p=.80 KS=1.032 tf 
p=.90 KS=1.18 
p=.95 KS=1.308 
p=.98 KS=1.464 
p=.99 KS=1.568 


verbal KS ee O el. 439 Poss son Una oe 0.516 | 0.936 | 1.760 


two-sided test 


—h 
> th -h hh -h 
> hh hh -h mh 
> hh mh 


h jh h 


p=.80 KS=1.032 nf f i nf 
p=.90 KS=1.18 nf f f f nf 
p=.95 KS=1.308 f f f f nf 
p=.98 KS=1.464 f f f f nf 
p=.99 KS=1.568 fi f f f nf 





f=flat nf=non-flat 


feoeeeVITT., KOLMOGROV-SMIRNOV TEST RESULTS FOR DIFFERENT 
P-LEVELS. FOR KS-VALUES HIGHER THAN THE ONES 
peoOCLATED WETHSTHE p-LEVEL ON THE LEFT, THE 
Poneto homor sr LATNESS LS REJECTED. AT THE .05 
PE Pion Oli THE CASE POR SUBJECTS 3 AND 
8 IN THE VERBAL SCORES. 
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ZI. KENDALL'S TAU 

The previous sections have indicated some tendency of the 
performance in verbal and spatial tasks - as shown by the 
scores -—- to be negatively correlated and perhaps out of 
phase at some frequency. We will pursue this further here. 
Klein's hypothesis is that a cycle of about 96 minutes exists 
in the performance of a human and that verbal and spatial 
performance are about 180° out of phase, i.e. good verbal 
performance goes together with less good spatial performance 
and vice versa. 

To test the second part of the hypothesis - the out-of- 
phase phenomenon - assuming for the time being that a cycle 
does exist, Kendall's tau coefficient was computed which is 
a measure of association between random variables from any 
bivariate population. 

The reasoning behind uSing Kendall's tau as opposed to 
the Pearson product-moment correlation coefficient (as in 
paragraph D of this section) at this point is that tau is 
invariant under all transformations of X and Y for which 
the order of magnitude is preserved; this allows the use 
of ranks instead of raw data. 

Perfect direct and indirect association between X and Y 
are reflected by perfect concordance and discordance, 
respectively. Concordance or agreement is a comparative 
type of association and means that large (small) values of 


X are associated with large (small) values of Y. [Gibbons, 
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Peco Discordance, the @pposite, associates large 
X-values with small Y-values and vice versa. 

The Null hypothesis in this test is that the X and Y 
are independent. The procedure to compute Kendall's tau, 
also called Theil's statistic, is as follows: Order the 
X, Y-pairs such that the X-values are in increasing order 
(this is shown in Table IX); the ranks of the Xs then run 
from 1 to n (1 to 32 in the experimental data under 
discussion). Now define P as the number of times the rank 
of a Y-value is greater than the rank of any of the following 
Y-values. Define Q as the number of times the rank of a 
Y-value is smaller than the rank of any of the following 
Y-values. P and Q then have to add to the total number of 


comparisons. 


P = # times r(Y,) > r(¥5) 
Cee fecemes = (¥.) = r(Y 5) 
J =~ eral i+2, ae 6! 

_ B(n-1) 
PHQ = 


Now define K to be the difference P-Q. K is the difference 
of two random variables and is distribution-free. kK is 
symmetric with a range of(-n(n-1))/2 to (n(n-1))/2 and an 
expected value E(K) = 0; the variance, sigma squared, equals 


(n(n-1) (2n+5))/18 under the Null hypothesis that X, and Y,; 
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are independent. The reasoning for this is the permutation 
hypothesis which states that under assumption of independent 
identical distribution each of N! permutations of the ranks 
is equally likely. Define T to be K divided by (n(n-1))/2. 


For large n the random variable Z can be treated as a standard 


normal variable. It is a linear function of T and is defined 
as: 
g=-=3%mm@-1) . o 
v2(2m+5 


The results of the test of the hypothesis of independence of 
the x; and Y; is shown in Table XK. Seven out of the eight 
subject results are not Significant at the .05 level, not 
even at the .10 level. The hypothesis of independence is 
therefore accepted. This might be confounded by the 
frequency effect. Thus one could look at lagged correlations. 


This is pursued below in paragraph L of this section. 


J. SPEARMAN'S COEFFICIENT R OF RANK CORRELATIONS 

Another computation performed to be able to test the 
hypothesis of independence between the two different kinds 
of performance scores was finding Spearman's coefficient of 
rank correlation. The proceure is as follows: The X observa- 
tions are replaced by their rank within the X,-group; the 
Y observations are replace by their ranks in the Y;~-group. 


The data then consist of n sets of paired ranks and the 
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Subl Sub2 Sub3 Sub4 Sup Sub6 Sub7 Subs 


1 on 30 oo is 7 18 8 5 

2 28 28 i Sel 31 Sl 3 26 

S Uy 2 28 10 16 GS ee 16 

4 Sie 29 aie 3 14 29 15 eu 

5 i Zi i 24 29 26 2 iol 

é 3 8 bal 14 ie 

i 24 tS 30 9 18 6 7 18 

8 7 24 4 Ze 23 3 18 ZS 

2 29 18 8 ibe, 10 7 ey SZ 

10 30 5 Zs 15 iS 2g 6 aun 
ie 14 10 7 24 9 Zs Ly 

iz 10 1 a 6 6 Ay) iy 6 
ie 26 3c 9 28 20 24 16 10 
14 iS 20 16 26 27 Z1 i + 
We eZ 11 18 8 30 i vio 14 
E : 20 Ai a2 9 2 32 / 

Ly 8 26 24 30 13 30 18: 29 
18 A / 1 18 Sc 16 24 Us 
Cs) I ~ Ze 2 Zs we 29 ey 
20 i 3 7 17 26 Ze 10 ie 
ie Ze 20 14 5 2 2 2 

22 2 16 J ie 28 14 el 25 
23 16 Zi 7 Zo eZ 10 30 28 
24 18 Z 14 4 iy 8 26 a 
29 oo 6 3 16 22 i3 28 : 
: 1 Z3 a1 19 20 4 20 

“aii Zl 8 (5 25 + 28 a ae 
28 9 Us) 26 3 on - 11 13 
29 22 10 6 12 3 ey Z 1 
30 3 oo 19 og i 1 20 8 
31 20 1 2 1 1 Zs 5 20 
oc 14 ta ike 20 2 3 l 24 


TABLE IX. RANKS OF VERBAL SCORES AFTER THE RANKS OF SPATIAL 
SCORES HAVE BEEN ARRANGED IN ASCENDING ORDER FROM 
1 TO 32 (LEFT HAND COLUMN). 
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TABLE X. LISTING OF P IN KENDALL'S TAU COMPUTATION AND 
STATISTIC L=K/SIGMA(K) FOR SIGNIFICANCE TEST. 
ONLY SUBJECT NO. 2 SHOWS SIGNIFICANT SIGNS OF 
DEPENDENCE; OVERALL INDEPENDENCE IS ACCEPTED. 


Do 





coefficient R can be computed. Define R, to equal rank (xX, ) 
and define Ss. to equal rank (Y,); define D; to equal the 
difference of R, and S;. The coefficient R is then defined 
mo De: 


ie 


—-—— 
m(n~-1) 


"R measures the degree of correspondence between rankings, 
instead of between actual variate values, but it can still 
be considered a measure of association between the samples 
and an estimate of the association between X and Y in the 
memeinuous bivariate population.” [{Gibbons, 1971, pg. 226] 
As displayed in Table XI, with the exception of subject 
number 2, none of the subjects shows significant rank 
correlation to reject the hypothesis of independence between 


the verbal and spatial scores. 


Rl: -.19098 ata Z=R(n-1)? 
Ree 3 De AAAS 

Reena 4s eC ee oe 
R4 : -.04857 One std. dev. : .1498 
Ro. re= 2651510 Z: -1.59 

Rone lgos4 Zeta 

Rime IeO 7 7Gue zr 40.16 

Rome 808629 020 


TABLE XI. SPEARMAN'S COEFFICIENT OF RANK CORRELATION AND CONVERSION TO 
STANDARD NORMAL RANDOM VARIABLE. 
Viti Ache r hon Of sUB0RCr NO. 2 NO SUBJECT SHOWS SIGNIFI- 
CANT RANK CORRELATION TO REJECT THE HYPOTHESIS OF INDEPENDENCE 
BETWEEN THE VERBAL AND SPATIAL PERFORMANCE SCORES. 
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K. RANK SUM MINUS EXPECTED RANK SUM UNDER Ay 

Another way pursued to examine the hypothesized out-of- 
phase phenomenon was a plot of the sums of paired ranks minus 
the expected rank sum under Hoe n+l = 33, vs. time period. 
Since under the hypothesis of Klein the performance of verbal 
and spatial tasks should be 180° out of phase. This would 
mean good performance (i.e. high ranks) in one task should 
go together with poor performance (i.e. low ranks) in the 
other. The expected value of this statistic is zero under 
Klein's hypothesis. 

The procedure to develop this statistic is as follows: 
Replace the raw data by their ranks in each group of X and Y. 
Add the ranks of each pair X and Y and subtract 33, which is 
n+l for the data under discussion. If the performance in the 
two different tasks really is out of phase the resulting 
values should be around zero. Very negative (or positive) 
results indicate an overall deviation to poor (or good) 
performance. Negative values can be expected in the begin- 
ning of the testing session due to a learning curve effect 
(LC) as well as later in the experiment due to fatigue. 

Under Klein's composite hypothesis of a 96 minute cycle 
pee OUt-of-phase phenomenon for the performance on the 
two tasks one would expect this measurement to oscillate 
around zero following the 96 minute cycle. The result as 
displayed in Figure 7 shows, however, that - if anything 
at all - it follows a slow wave movement which is presumed 


to be induced by the lunch time effect. 
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FIGURE 7. PLOT OF THE RANK SUMS OF BOTH VERBAL AND SPATIAL 
RANKS AT EACH INTERVAL MINUS THE EXPECTED RANK SUM 
UNDER KLEIN'S HYPOTHESIS OF OUT-OF~-PHASENESS. 
LOW VALUES CAN BE EXPECTED AT BOTH ENDS OF THE 
GRAPH; DUE TO A LEARNING CURVE IN THE BEGINNING 
AND DUE TO FATIGUE AT THE END OF THE EXPERIMENT. 
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L. LAG SHIFTING OF RANKS 

To investigate the out-of-phase hypothesis further, an 
analog to an auto-correlation procedure was devised. After 
replacing the raw scores by their ranks, one column of 
performance socre ranks was kept stationary and the other 
was shifted ("lagged") downwards for negative lag, upwards 
for positive; i.e. in this procedure spatial score (rank) 
position 1 was paired with verbal rank position 2 for 
positive lag 1; spatial position 1 was paired with verbal 3 
for positive lag 2, etc... . For each lag in the range from 
0 to +15 the simple Pearson product-moment correlation was 
computed and plotted as shown in Figure 8. 

The reasoning for this procedure is the following: Under 
Klein's hypothesis of out-of-phaseness one can expect a high 
positive correlation of two performance scores after about 
3-4 shifts (lags). Since the scores were collected every 
15 minutes, 3 to 4 shifts correspond to 45 to 60 minutes. 
With a hypothesized cycle period of 80 to 110 minutes, 
measurements that were initially out of phase would then be 
in phase and should be highly positively correlated. 

The results for all subjects are similar to the repre- 
sentative plot of Figure 8, where a cycle of the hypothesized 
period cannot be found. The symmetry of the correlation plot, 
however, is striking: In all eight subjects the highest 


positive correlation is found after 8 to 10 shifts, indicating 


Dy, 





Meee Goce noe 4-5) tO) > hours. Te could also - which is probably 
more realistic - simply indicate the positive correlation 
between poor scores at the beginning, after about 4 hours at 


lunch time and again 4 hours later due to fatigue. 
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FIGURE 8. PLOT OF THE PEARSON PRODUCT-MOMENT CORRELATION FOR 
elem e oot rto ei POSITIVE AND NEGATIVE DIRECTIONS, 
Risener tvyEnby. THE EXPECTED STRONGLY POSITIVE 


CORRELATIONS AT ABOUT 3-4 SHIFTS CANNOT BE DETECTED. 
Pop teowinneevEoPhOlL, HERE: SUBJECT NO. 3. 
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et CON Ginws Pen 


Considering the impact and importance of research in this 
part of cycle research as described in Section I, paragraph 
F, Klein and Armitage certainly investigated an interesting 
aspect of biological rhythmicity. Based on previous sleep 
research their hypothesis was a logical one. However, based 
on the research and analysis that went into the compositon of 
this thesis, their findings cannot be agreed to wholeheartedly. 
The inhomogeneity in the small group of their subjects seems 
to be a fact that has not been considered before. The influ- 
ence of the 24-hour cycle, the circadian rhythm, on the test 
procedures and their timing seems sufficient to account for 
any rhythmicity detected by Klein and Armitace. 

While further research into a possible ultradian rhythm 
1s necessary and will be interesting, it needs to be evalu- 
ated if the impact of such a cycle ~- should it be found 
conclusively - is worth the effort in time and manpower. It 
also seems as if a useful predictability cannot be estab- 
lished since any possible influence of a possible ultradian 
rhythm is completely overshadowed by the variability among 
subjects. 

It is suggested that a future experiment make use of 
some of the discussion objects of section I, paragraph I, 
that were not obvious to Klein and Armitage's experimental 


set-up. 
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